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Abstract 

With advances in brain-computer interface (BCI) researcli, a portable few- or single-channel BCI system has become 
necessary. Most recent BCI studies have demonstrated that the common spatial pattern (CSP) algorithm is a powerful tool in 
extracting features for multiple-class motor imagery. However, since the CSP algorithm requires multi-channel information, 
it is not suitable for a few- or single-channel system. In this study, we applied a short-time Fourier transform to decompose a 
single-channel electroencephalography signal into the time-frequency domain and construct multi-channel information. 
Using the reconstructed data, the CSP was combined with a support vector machine to obtain high classification accuracies 
from channels of both the sensorimotor and forehead areas. These results suggest that motor imagery can be detected with 
a single channel not only from the traditional sensorimotor area but also from the forehead area. 
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Introduction 

The brain-computer interface (BCI) is a new communication 
scheme that depends on neither the brain's normal output nerve 
pathways nor the muscles. Using a BCI system, one can directly 
translate brain activities into sequences of control commands for 
an output device such as a computer application [1,2]. Motor 
imagery is a mental process by which an individual rehearses or 
simulates a given action in his/her mind but without actually 
producing movement; it is assumed to involve similar cortical 
areas that are activated during actual motor preparation and 
execution [3]. Motor imagery has been widely used as a major 
approach in BCI studies [4,5]. 

In most BCI research, whole-head multi-channel data are used 
to produce high accuracy. However, the large number of 
electrodes required implies a longer time spent in channel 
preparation. In addition, the BCI system may be expensive as 
many amplifiers are needed. As BCI research has advanced, 
portable systems with fewer channels have become essential in 
applying BCIs to everyday life and home applications. The 
preparation of the electrodes involves putting gel or paste on the 
scalp and fitting an electroencephalography (EEG) cap on the 
head. Additionally, the skin needs to be prepared to deal with the 
hair under the electrodes. By comparison, in long-term daily-life 
BCI usage, it is much easier to fit EEG electrodes on the forehead 
area because there is no hair in this area. Additionally, it is 
inconvenient and uncomfortable to place multiple electrodes on 
the scalp. A realistic solution is to place a few electrodes or a single 
electrode over the motor cortex or, since it is easier and more 
comfortable to place electrodes on the forehead to get the motor 
imagery signal from the forehead if possible. Thus, in this study. 



we hypothesize that if high classification accuracy can be obtained 
in motor imagery tasks using only a few EEG channels or a single 
EEG channel from forehead electrodes, then the use and 
application of a motor-imagery BCI system wiU be much easier 
and more convenient. 

Our hypothesis must address how to extract adequate and 
appropriate features of motor imagery from a system comprising 
few or a single channel. The common spatial pattern (CSP) 
method is commonly used for effective feature extraction [6-9]. 
The main idea of CSP method is to use a linear transform to 
project multi-channel EEG data into a low-dimensional spatial 
subspace with a projection matrix, of which each row consists of 
weights for channels. However, CSP can only be effectively used if 
there are many electrodes available [10]. Therefore, it is not 
appropriate to use CSP for a few- or single-channel system. 

Some previous research has focused on single-channel electro- 
corticography BCI [11]. MiUler-Putz et al. reported success in 
detecting foot motor imagery (one-class) employing single-channel 
EEG [12]. Pfurtschellers group [13,14] used one Laplacian 
channel (signals from the surrounding electrodes were used) to 
detect motor imagery. Some multi-channel BCI research has also 
attempted single-channel analysis, but the signals from the 
remaining channels were used during the analysis [15,16]. 

In this paper, we propose a method of using only single-channel 
EEG data to classify four-class motor imagery. We first decompose 
the single-channel EEG signal into the time-frequency domain. In 
the time-frequency domain, we treat the frequency band as a 
variable, and we thus have multi-channel time-varying inputs. 
With this transformation, the original single-channel input can be 
transformed into a multi-channel input. Therefore, CSP can be 
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used in feature extraction. To tlie best of our knowledge, this 
research is the first to address four-class motor-imagery BCI with a 
single-channel EEG. 

Methods 

Data acquisition 

In our retrospective study, we used the dataset Ilia from the 
2005 BCI competition provided by the University of Technology, 
Graz, Austria [17]. AH participant records and information used in 
this study were anonymous and were not identified in the dataset. 
The Ethics Committee of Southeast University approved our study 
protocol and methods before we conducted this research. This 
dataset comprises 60-channel EEG data for a four-class (left hand, 
right hand, foot, and tongue) classification task. The datasets were 
recorded for three participants, K3, K6 and LI, using a Neuroscan 
EEG amplifier. The left mastoid served as a reference and the 
right mastoid as the ground. The EEG was sampled at 250 Hz 
and filtered between 1 and 50 Hz. A notch filter allowed 
suppression of line noise. Sixty EEG channels were recorded 
according to the scheme in Figure 1 . 

The participants were seated relaxed in a chair with armrests, 
and were instructed to perform imaginary movements prompted 
by a visual cue. Each trial started with an empty black screen; at 
time point ? = 2 s a short beep tone was presented and a cross + 
appeared on the screen to catch the participants attention. Then, 
at / = 3 s, an arrow appearing for 1 .25 s pointed either to the left, 
to the right, upwards or downwards. Each position indicated by 
this arrow instructed the participant to imagine a left hand, right 
hand, tongue or foot movement, respectively. The respective 
imaginary movement was to last until the cross disappeared at 
t = 7 s (see Figure 2). The data set recorded from participant K3 
consisted of 9 runs, whereas the data sets from K6 and LI 
consisted of 6 runs each. Each of the four cues was displayed 1 0 
times within each run in a randomized order, and each trial lasted 
for 7 s. Trials with labels, which indicated that the trials had 
visually identified artifacts, were excluded from the input data for 
analysis. 



Nose 




Figure 1. Position of EEG electrodes. 
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Figure 2. Timing of the paradigm. 
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EEG electrode selection 

Previous knowledge tells us that the C3, Cz and C4 electrodes, 
which are over the sensorimotor area, record important charac- 
teristics of motor imagery [18,19]. In this study, we selected EEG 
data from C3, Cz and C4. Moreover, Fpl, Fpz and Fp2, which 
are over the forehead, were also used in this study. 

Time-frequency analysis 

The purpose of this study was to distinguish four-class motor 
imagery only using single-channel EEG data. Therefore, it was 
important to extract more information from single-channel data. 
In this study, we employed time-frequency analysis to obtain both 
temporal and frequency characteristics. By performing time- 
frequency analysis, a single time-varying signal can be converted 
into multiple time-varying signals at diflferent frequencies. Such a 
channel-increasing method allows past multi-channel BCI ap- 
proaches, such as the use of CSP, to be applied to the single- 
channel case. 

Short-time Fourier transform (STFT) analysis, wavelet trans- 
form (WT) analysis and HUbert-Huang transform (HHT) analysis 
are the most used time-frequency analysis methods. The time 
resolution of the WT is hundreds of milliseconds, with a central 
frequency below 20 Hz [20], while past motor imagery research 
has reported that the mu (8-13 Hz) and beta (16-25 Hz) rhythms 
served as effective classification features to distinguish motor 
imagery [21,22]. Further, empirical mode decomposition in HHT 
analysis often encounters such problems as mode mixing and 
ending effect, and is very sensitive to noise [23]. Compared with 
these two methods, STFT analysis has acceptable time and 
frequency resolution below 20 Hz. The most important point is 
that the calculation cost of the STFT is far lower than those of the 
WT and HHT. Thus, the STFT is a rehable method for BCI 
analysis. In this study, we used the STFT (spectrogram function of 
Matlab's Signal Processing Toolbox) for time-frequency analysis of 
single-channel EEG data, while a 50% overlapped Hamming 
window of size 128 samples was used, and the number of FFT 
nfft= 128 samples (each 100 original samples were zero-padded to 
128 points). Since the mu (8-13 Hz) and beta (16-25 Hz) 
frequency bands play a key role in classification of motor imagery 
[21,22], the 8-30 Hz frequency band was investigated. 

Feature extraction 

In most BCI research, the CSP is widely used to separate two 
diflferent classes. The idea behind using such a binary CSP is to 
find an optimal decomposition to transform two classes of data 
into a common space, in which the two classes of transformed data 
have the same principal components, and their corresponding 
eigenvalues add up to a unit matrix. The idea behind the CSP is to 
find a spatial filter that can be applied such that the projected 
signal has high power for one class and low power for the other. 
Here, the power in a trial is calculated using the variance in the 
time domain. The binary CSP can discriminate only between two 
diflferent classes (e.g., left versus right). For k-class paradigms, an 
extension has been proposed [24,25]: the basic idea is to 
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decompose the k-class problem into a set of k binary problems 
(right versus rest, left versus rest, etc.). Each problem consists of 
discriminating one class against the remaining classes (one versus 
the rest, OVR) [26]. 

Here, we will derive an OVR algorithm for the four-class case. 
We denote the STFT matrices X of a single-channel EEG signal 
for four different directions as Xi, X2, X^ and X4 with dimensions 
of M by A'^, where M and N are the numbers of frequency and 
time bands, respectively. The spatial covariance of STFT matrices 
for these conditions can therefore be estimated by 

Ci = XiXi^, /= 1,2,3,4 (1) 

where Xf denotes the transpose of Xj. As for the binary CSP, we 
can build the composite covariance matrix as 

C=Cl + C2 + C3 + C4 (2) 

The composite covariance matrix can be factored by eigen- 
decomposition as 



A-|-A'=7 (10) 

Combining equations (4) and (6)-(10), we have 

{Ui''P)Ci{Ui^Ff+{Ui''P)Ci{U/pf=I (11) 

Si and share common eigenvectors and the sum of 
corresponding eigenvalues for these two conditions will always 
be one. 

From equation (1 1), the variance accounted for by the 
eigenvectors corresponding to the m largest eigenvalues wiU be 
maximal for Si , and minimal for S>i . Therefore, the transforma- 
tion of the STFT matrix X onto eigenvector space will maximize 
the variance difference between 5*1 and 5j. The projection matrix 
Wi is 

Wi = Ui^P (12) 



C= UoAUo^ (3) 

where Uo is the M x M unitary matrix of prin( ij)al components, 
and A is the M x M diagonal matrix of eigenvalues. 
The whitening transformation matrix is then formed as 

P = A-i/2[/o^ (4) 

To see how to extract common spatial patterns specific to 
condition 1, we let 

Ci' = C2 + Q + C4 (5) 

Ci and Cfi are then individually transformed as 

Si=PCiP^ (6) 



A Im-hy-M spatial filter Wil was built with the first and last m 
rows of Wi . Then, the STFT matrix X is filtered with this spatial 
filter: 

Zi = WilX (13) 

The filtering of the STFT matrix X leads to a new time- 
frequency matrix Zi . The pattern is designed such that the Z\ that 
results from the X filtered with W\l has maximum variance for S\ 
and minimum variance for S^j. In this way, we can extract the 
common spatial patterns specific, to ^i; i.e., condition 1. 

In the same way as above, we can build spatial filters W2L, W^l 
and W^L to get the filtered time-frequency matrices Z2, Z3 and Z4 
for the remaining conditions 2, 3 and 4, respectively. 

Classification 

Feature vectors for four diflferent conditions are obtained: 



Si =PCi P' 



(7) 



It can be demonstrated that Si and S^ share common principal 
components [24] . If the eigen-decomposition of can be written as 



Si = [/iAit/i' 



(8) 



where U\ is the eigenvector matrix of Si, which corresponds with 
eigenvalue matrix Ai. Then S^ can be factored as 



Si =£/iAi Ui' 



(9) 



and the sum of the corresponding eigenvalue matrices Ai and Aj 
will be a unit matrix: 



/ \ 

VAR, 



/= 1,2,3,4 



(14) 



where VARf is the variance of Z,- among time points (l-by-2w). A 
composite feature vector (l-by-8w) is defined as: 



/=[fl/2/3/4] 



(15) 



As a state-of-the-art classification methodologv', the support 
vector machine (SVM) [27] has sound theoretical foundations and 
has served as a powerful tool for solving classification problems 
[28]. With respect to the recognition of a small sample of 
nonlinear and high-dimensional data, SVM has better adaptabil- 
ity, stronger classification ability and higher computational 
efficiency. In this study, we used the LIBSVM package [29] to 
implement SVM classification, and traditional C-support vector 
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classification (C-SVC) [30] was used as the support vector 
classifier. 

The basic idea of SVM is to look for the optimal decision 
hyperplane that best separates the data points into different classes 
with a maximum margin, while allowing errors during separation; 
i.e., map the input x onto a high-dimensional feature space 
{z = (^(x)) and construct an optimal hyperplane defined by 
wz — b = Q to separate examples into different classes, where w 
is the normal vector and b is the bias of the separation hyperplane. 
This is done by solving the primal problem: 



I ♦ C3 ■ Cz A C4 M Fp1 X Fpz » Fp2 -a-Avg| 

Subject: K3 

.»'«r .... 

< T xxx xx" 

0.50 I \ 1 1 1 \ 1 1 1 1 

1 23456789 10 



min -\\wf + cY,ii, 

^ /=i 
s.t. j-^w-z, -ft) >!-<,-, ^,>0 



(16) 



where x,- is the /-th input sample, j', is the class label value 
corresponding to Xi, n is the number of input samples, if, is the 
slack variable that allows an example to be in the margin 
(0 < <f,- < 1, also called a margin error) or to be misclassified > 1), 
and C is a penalty factor. 

The equation (16) can be solved by its dual problem using 
Lagrange optimization; i.e., we solve the quadratic programming 
(Q_P) problem 



max 1 ~ 7 yiyjaiajK ( 

s.t. }',fl, = 0, 0 < a, < C 



(17) 



where a,- is the Lagrange multiplier from the QP problem, and 
K(Xi,Xj) is the kernel function. 

Because of the nonlinear properties of EEG signals, in this 
study, the radial basis kernel function (RBF) is selected as the SVM 
kernel function: 



K{xi,Xj) =exp(-y||x,-x,f ), y>0 



(18) 



where y is the kernel parameter. The kernel parameter y and 
penalty factor C are the main parameters that affect the 
performance of the SVM. y decides the distribution of the 
transformed data in space, and the penalty factor C controls the 
degree of punishment for right or wrong classification, thus 
balancing classification violation and the margin. Therefore, y and 
C play an important role in improving the correct rate and 
classification efficiency of the SVM. In this study, the grid search 
method [31] was used to optimize y and C. To prevent the 
overfitting problem, we used a 10 x 10-fold cross-validation 
procedure. In this procedure, the training set is divided into 10 
subsets of equal size. Sequentially, one subset is tested using the 
classifier trained on the remaining nine subsets. The optimal y and 
C are obtained when the cross-validation accuracy is a maximum. 
The final classification accuracy is the mean result of the 10-fold 
cross-validation procedure. 

Results 

The main free parameter afiecting the classification accuracy is 
m, which is the number of projections to CSP used to buUd the 
feature vector. The classification accuracies for participants K3, 
K6 and LI with different m values were compared in the range 
from 1 to 10 (see Figure 3). According to the curve of averaged 



0 90 
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m 
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Figure 3. Accuracy values at different m values for participants 
K3, K6 and L1 . The mean accuracy value peaked at m = 7 for all 
three participants. 

doi:1 0.1 371/journal.pone.009801 9.g003 

accuracy, it was clear that the classification accuracy peaked when 
m = l for all three participants. 

Table 1 presents accuracy values for different time ranges and 
electrodes for participants K3, K6 and LI. The time ranges are set 
as four different ranges: 3~4, 4~5, 5~6 and 6~7 s. Table 1 also 
gives accuracy values for different EEG electrodes (i.e., Fpl, Fpz, 
Fp2, C3, Cz and C4) and for all the three participants. Two-way 
analysis of variance (ANOVA) was employed to investigate the 
effects of the time range and electrode selection. There were no 
significant difference for either the time range (P = 0.62) or the 
electrode selection (P = 0.91). 

Discussion 

Past work [7,15,32-36] used few- or single-channel EEG data to 
classify four-class motor imagery using the 2005 BCI competition 
dataset, which was used in our study. We list the classification 
accuracy results obtained in these studies in Table 2. 

Most past research used more than two electrodes to extract 
features. Only Schlogl et al. [15] used the best single channel of 60 
EEG channels for classification. However, since they used all 60 
channels of data and then picked the best single channel, their 
method differs from that of using only single-channel information 
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Table 1. Accuracy values for different time ranges and electrodes for participants K3, K6 and LI. 



Time Range 


CH 


Pot 


3 — 4s 


4-^55 






Av9 


Fpl 


K3 


0.52 


0.74 


0.58 


0.58 


0.63 




K6 


0.63 


0.83 


0.63 


0.63 






LI 


0.55 


0.58 


0.56 


0.78 




Fpz 


K3 


0.53 


0.60 


0.64 


0.58 


0.62 




K6 


0.55 


0.83 


0.53 


0.63 






LI 


0.78 


0.58 


0.67 


0.54 




Fp2 


K3 


0.56 


0.64 


0.73 


0.67 


0.65 




K6 


0.63 


0.78 


0.51 


0.63 






LI 


0.55 


0.75 


0.67 


0.67 




C3 


K3 


0.64 


0.85 


0.69 


0.64 


0.64 




IVD 


0.56 


0.73 


0.50 


0.63 






LI 


0.50 


0.58 


0.67 


0.67 




Cz 


K3 


0.75 


0.60 


0.56 


0.56 


0.64 




K6 


0.80 


0.60 


0.60 


0.57 






LI 


0.56 


0.67 


0.63 


0.75 




C4 


K3 


0.71 


0.64 


0.67 


0.70 


0.65 




K6 


0.88 


0.63 


0.63 


0.63 






LI 


0.56 


0.50 


0.63 


0.71 




Avg 




0.62 


0.67 


0.61 


0.64 





doi:l 0.1 371/journal.pone.009801 g.tOOl 



to detect motor imagery.The best classification result in past 
research was obtained by Li et al. [35], who used three combined 
channels and got 83.1, 84.4 and 85.6% for participants K3, K6 
and LI, respectively. 

Unlike past research that used at least two combined channels 
or selected the best channel from multiple channels, our method 
used only single-channel data to get 73.4, 78.3 and 75.2% from 



the Fp2 channel, and 71.3, 88.1 and 71.2% from the C4 channel 
for each participant respectively. This result is relatively better 
than the results of most of the previous studies. 

Although the averaged accuracy for the 4~5 s time range was 
considerably higher than that for other time ranges, the ANOVA 
result showed that there were no significant differences in other 
time ranges. Wang et al. [32] found that the best accuracy for 



Table 2. Best accuracy for different feature-extraction and classification methods. 



Feature Extraction 


Classifier 


Channel 


Accuracy(%) 












K3 


K6 


LI 


AAR 


MDA 


Best single channel of 60 ch [38] 


56.9 


46.5 


48.5 


AAR 


MDA 


Three best single channels of 60 ch [38] 


66.6 


38.5 


49.5 


CAR+CSP 


NN 


C3 and C4 [44] 


41.6 


41.7 


49.5 


Barlow method 


SVM 


C3 and C4 [12] 


53.3 


42.5 


55.8 


Barlow method 


SVM 


C3, Cz and C4 [1 2] 


63.3 


45.0 


60.0 


WPD 


ME 


C3, Cz and C4 [14] 


90.8 


66.0 


76.9 


WPD+CSP 


SVM+NN 


C3, Cz and C4 [27] 


83.1 


84.4 


85.6 


PLV 


SVM+Quicksort 


C3, Cz and C4 [22] 


86.0 


82.0 


77.0 


Sparse PCA+Sparse CSP 


SVM 


60 ch [39] 


85.1 


81.6 


80.1 


STFT+CSP 


SVM 


Fp2 [our Method] 


73.4 


78.3 


75.2 






C4 [our Method] 


71.3 


88.1 


71.2 



★Since there are four classes of imagery movements, the chance level is 25. 

AAR: adaptive autoregressive; MDA: minimum distance analysis; CAR: common average reference; CSP: common spatial pattern; NN: neural network; SVM: support 
vector machine; WPD: wavelet packet decomposition; ME: mixture of experts; PLV: phase-locking value; PCA: principal component analysis; STFT: short-time Fourier 
transform. 

doi:l 0.1 371 /journal.pone.009801 9.t002 
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different participants was obtained for different time ranges. Our 
result supports their conclusion. 

The ANOVA result for electrode selection verified that there 
were no significant differences in accuracies obtained with the 
electrodes used in this study. The accuracies obtained using C3, 
Cz and C4, which are over the sensorimotor area, were equivalent 
to those obtained using Fpl, Fpz and Fp2, which are far from the 
motor cortex. As a result of volume conduction [37], the local 
EEG activity field also produces a far-field potential [38] and the 
active potential will not only be recorded directiy above the 
generator but wUl also appear as a function of current spreading 
over the skuU and scalp [39] . Fried et al. [40] reported that the P14 
component, which is generated by the parietal lobe, can be 
similarly recorded by both parietal and frontal lobe electrodes. 
Nunez [41] reported high coherence of EEG channels over large 
distances. More direcdy, Li et al. [42] found high correlation in the 
event-related potential, frequency domain and event-related 
spectral perturbation between forehead-area EEGs and sensori- 
motor-area EEGs during a motor imagery task. These past studies 
and our result confirm that forehead EEG electrodes can be used 
to detect motor imagery equally as well as using traditional 
electrodes over the sensorimotor area. 

The CSP algorithm has been shown to be one of the most 
popular and efficient algorithms for BCI detection [6-9]. A 
disadvantage of the CSP method is the large number of electrodes 
needed [10]. The accuracy will be poor if the number of electrodes 
is insufficient [43]. In this study, employing the STFT, we 
transformed the time domain signal of a single channel into 
multiple frequency-domain signals. If we treat such multiple 
frequency-domain signals as a form of multi-channel information, 
the CSP can be applied to single-channel EEG. Using the STFT, 
the time-domain signal is converted to a time-frequency domain 
signal. Thus, the one-dimensional feature in the time domain is 
expanded to two-dimensional features in the time-frequency 
domain. Past studies have shown that the frequency feature plays 
an important role in BCI detection [14,35,43,44]. Our method 
expands time features to time and frequency features, allowing 
more feature vectors to be used in feature- d(;tection. In this study, 
we used such a method to examine the classification accuracies of 
different single electrodes. The results demonstrate that expansion 
of a single time-domain signal to multiple frequency-domain 
signals is an efficient approach to obtain high classification 
accuracy of motor imagery with a single-channel EEG. 

References 

1. Birbaumcr N (2006) Breaking the silence: brain-computer interfaces (BCI) tor 
communication and motor control. Psychophysiology 4.S: 517-532. 

2. Shih JJ, Krusicnski DJ, Wolpaw JR (2012) Brain-computer interfaces in 
medicine. In: Mayo Clinic Proceedings. Elsevier, volume 87, pp. 268-279. 

3. Jeannerod M (2001) Neural simulation of action: a unifying mechanism for 
motor cognition. Neuroimage 14: SI 03— SI 09. 

4. Lemm S, Schafer C, Curio G (2004) BCI competition 2003-data set HI: 
Probabilistic modeling of sensorimotor fl rhythms for classification of imaginary 
hand movements. IEEE Transactions on Biomedical Engineering 51: 1077— 
1080. 

5. Nicolas-Alonso LF, Gomez-Gil J (2012) Brain computer interfaces, a review. 
Sensors 12: 1211-1279. 

6. Koles ZJ, Lazar MS, Zhou SZ (1990) Spatial patterns underlying population 
differences in the background EEG. Brain Topography 2: 275—284. 

7. Shi LG, Li Y, Sun RH, Lu BL (201 1) A sparse common spatial pattern algorithm 
for brain-computer interface. In: Neural Information Processing Lecture Notes 
in Computer Science. Springer, volume 7062, pp. 725—733. 

8. Blankertz B, Miiller KR, Curio G, Vaughan TM, Schalk G, et al. (2004) The 
BCI competition 2003: Progress and perspectives in detection and discrimina- 
tion of EEG single trials. IEEE Transactions on Biomedical Engineering 51: 
1044-1051. 



Open Questions 

Compared with the traditional motor imagery research that is 
based on sensorimotor area EEGs, detecting motor imagery based 
on forehead area EEGs is a novel approach. From the perspective 
of convenience and comfort, forehead-type BCI systems may be 
highly possible and practical for usage in everyday life in the 
future. How(;ver, forehead area EEGs also inevitably involve 
electrooculography (EOG) and electromyography (EMG) signals. 
BCI research must ensure that it is only EEG signals, but not EOG 
or EMG signals, that play a key role in classification. Although, we 
had already tried to reduce EOG and EMG effects in our research 
by excluding visually identified artifacts, more research and 
discussion about this problem based on a large number of data 
is needed in the future. 

In this research, we selected sites C3, C4 and Cz near the 
sensorimotor area, which are considered to have a relationship to 
motor imagery and are widely used in BCI studies. Moreover, 
considering usage in everyday life, we also selected Fpl, Fp2, Fpz 
at the forehead area, which are easy to locate and set. Although 
higher classification accuracies were obtained from those elec- 
trodes in this study, it is hard to conclude that those electrodes are 
the optimum channel(s) for all other participants. Our research has 
just shown that these electrodes would be good candidates for 
single-channel BCI system. 

Another limitation of this study is that the dataset from the 2005 
BCI competition that is used in this research only contains 3 
participants. Further verification with more datasets is needed to 
demonstrate the robustness of our proposed method. 

Conclusions 

In this study, we applied STFT to decompose single-channel 
EEG signal into the time-frequency domain to construct multi- 
channel information. Based on these reconstructed data, we used 
CSP combined with a SVM to obtain equivalent high classification 
accuracies from both the sensorimotor and forehead areas, which 
suggests that motor imagery can be detected with a single channel 
not only from the traditional sensorimotor area but also from the 
forehead area. 
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